# High-Fidelity Speech Synthesis
Parler Tts Large V1
Apache-2.0
A 2.2 billion parameter text-to-speech model trained on 45,000 hours of audio data, supporting voice feature control via text prompts
Speech Synthesis
Transformers English

P
parler-tts
28.69k
252
Vocos Mel Hifigan Compat 44100khz
MIT
Vocos is a fast neural vocoder that achieves efficient audio reconstruction by generating spectral coefficients, particularly suitable for text-to-speech tasks.
Speech Synthesis
TensorBoard Other

V
patriotyk
2,222
10
Amadeus
This is a Japanese text-to-speech (TTS) model trained on the ESPnet2 framework, using the VITS architecture, completed by mio on the amadeus dataset.
Speech Synthesis Japanese
A
mio
37
85
Gunnarthor Talromur A Fastspeech2
A FastSpeech2 text-to-speech model trained on the ESPnet framework and talromur dataset, supporting Icelandic speech synthesis.
Speech Synthesis English
G
espnet
50
0
Featured Recommended AI Models